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Abstract. Let G = (V, E) be any undirected graph on V vertices and E edges. A path P 
between any two vertices u, v £ V is said to be i-approximate shortest path if its length is 
at most t times the length of the shortest path between u and v. We consider the problem 
of building a compact data structure for a given graph G which is capable of answering 
the following query for any u,v, z £ V and t > 1. 

report t-approximate shortest path between u and v when vertex z fails 
We present data structures for the single source as well all-pairs versions of this problem. 
Our data structures guarantee optimal query time. Most impressive feature of our data 
structures is that their size nearly match the size of their best static counterparts. 



1. Introduction 

The shortest paths problem is a classical and well studied algorithmic problem of com- 
puter science. This problem requires processing of a given graph G = (V,E) on n = \V\ 
vertices and m = \E\ edges to compute a data structure using which shortest path or dis- 
tance between any two vertices can be efficiently reported. Two famous and thoroughly 
studied versions of this problem are single source shortest paths (SSSP) problem and all- 
pairs shortest paths (APSP) problem. 

Most of the applications of the shortest paths problem involve real life graphs and 
networks which are prone to failure of nodes (vertices) and links (edges). This has motivated 
researchers to design dynamic solution for the shortest paths problem. For this purpose, one 
has to first develop a suitable model for the shortest paths problem in dynamic networks. 
In fact two such models exists, and each of them has its own algorithmic objectives. 

The shortest paths problem in the first model is described as follows : There is an 
initial graph followed by an on-line sequence of insertion and deletion of edges interspersed 
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with shortest path (or distance) queries. Each query has to be answered with respect to the 
graph which exists at that moment (incorporating all the updates preceding the query on 
the initial graph). A trivial solution of this problem is to recompute all-pairs shortest paths 
from scratch after each update. This is certainly a wasteful approach since a single update 
usually does not cause a huge change in the all-pairs distance information. Therefore, the 
algorithmic objective here is to maintain a data structure which can answer distance query 
efficiently and can be updated after any edge insertion or deletion in an efficient manner. In 
particular, the time required to update the data structure has to be substantially less than 
the running time of the best static algorithm. Many novel algorithms have been designed 
in the last ten years for this problem and its variants (see [6] and the references therein). 

On one hand the first model is important since it captures the worst possible hardness 
of any dynamic graph problem. On the other hand, it can also be considered as a pessimistic 
model for real life networks. It is true that the networks are never immune to failures. But 
in addition to it, it is also rare to have networks which may have arbitrary number of failures 
in normal circumstances. It is essential for network designers to choose suitable technology 
to make sure that the failures are quite infrequent in the network. In addition, when a 
vertex or edge fails (goes down), it does not remain failed/down indefinitely. Instead, it 
comes up after some finite time due to simultaneous repair mechanism going on in the 
network. These aspects can be captured in the second model which takes as input a graph 
and a number f Cn. This model assumes that there will be at most I vertices or edges 
which may be inactive at any time, though the corresponding set of failed vertices or edges 
may keep changing as the time progresses : the old failed vertices become active while some 
new active vertices may fail. The algorithmic objective in this model is to preprocess the 
given graph to construct a compact data structure which for any subset S of at most I 
vertices may answer the following query for any u,v E V . 

Report the shortest-path (or distance) from u to v in G\S. 

It is desired that each query gets answered in optimal time : retrieval of distance in 
O(l) time and the shortest path in time which is of the order of the number of its edges. 
The ultimate research goal would be to understand the complexity of the above problem 
for any given value I. In this pursuit, the first natural step would be to efficiently solve 
and thoroughly understand the complexity of the problem for the case I = 1, that is, the 
shortest paths problem avoiding any failed vertex. Interestingly, this problem appears as a 
sub problem in many other related problems, namely, Vickrey pricing of networks [9J, most 
vital node of a shortest path the replacement path problem [12] , and shortest paths 
avoiding forbidden subpaths pp. 

The first nontrivial and quite significant breakthrough on the all-pairs version of this 
problem was made by Demetrescu et al. [7J. They designed an 0(n 2 log n) space data 
structure, namely distance sensitivity oracle, which is capable of reporting the shortest path 
between any two vertices avoiding any single failed vertex. The preprocessing time of this 
data structure is 0(mn 2 ). Recently, Bernstein and Karger [3] improved the preprocessing 
time to 0(mn log n). Though 0(n 2 logn) space bound of this all-pairs distance sensitivity 
oracle is optimal up to logarithmic factors, it is too large for many real life graphs which 
appear in various large scale applications [13]. In most of these graphs usually m <^ n 2 , 
hence a table of 0(n 2 ) size may be too large for practical purposes. However, it is also known 
[7J that even a data structure which reports exact distances from a fixed source avoiding a 
single failed vertex will require Q(n 2 ) space in the worst case. So approximation seems to 
be the only way to design a small space compact data structure for the problem of shortest 
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paths avoiding a failed vertex. A path between u, v € V is said to be t-approximate shortest 
path if its length is at most t times that of the shortest path between the two. The factor 
t is usually called the stretch. We would like to state here that many algorithms and data 
structures have been designed in the last fifteen years for the static all-pairs approximate 
shortest paths (see [21 [13] and references therein) . The prime motivation underlying these 
algorithms has been to achieve sub-quadratic space and/or sub-cubic preprocessing time 
for the static APSP problem. However, no data structure was designed in the past for 
approximate shortest paths avoiding any failed vertex. 

In this paper, we present really compact data structures which are capable of reporting 
approximate shortest paths between two vertices avoiding any failed vertex in undirected 
graphs. The most impressive feature of our data structures is their nearly optimal size. In 
fact their size almost matches the size of their best static counterparts. 

1.1. New Results 

Single source approximate shortest paths avoiding any failed vertex. 

First we address weighted graphs. For the weighted graphs, we present an 0(m log n) time 
constructible data structure of size 0(n log n) which can report 3-approximate shortest 
path from the source to any vertex v £ V avoiding any x GV. We then consider the case 
of undirected unweighted graphs. For these graphs, we present an 0(n^§^) space data 
structure which can even report (1 + e)-approximate shortest path for any e > 0. 
All-pairs approximate shortest paths avoiding any failed vertex. 
Among the existing data structures for static all-pairs approximate shortest paths, the 
approximate distance oracle of Thorup and Zwick [13] stands out due to its amazing features. 
Thorup and Zwick [13] showed that an undirected graph can be preprocessed in sub-cubic 
time to build a data structure of size 0(kn l+l / k ) for any k > 1. This data structure, despite 
of its sub-quadratic size, is capable of reporting (2k — l)-approximate distance between any 
two vertices in 0(k) time (and the corresponding approximate shortest path in optimal 
time), and hence the name oracle. Moreover, the size-stretch trade off achieved by this data 
structure is essentially optimal. It is a very natural question to explore whether it is possible 
to design all-pairs approximate distance oracle which may handle single vertex failure. We 
show that it is indeed possible for unweighted graphs. For this purpose, we suitably modify 
the approximate distance oracle of Thorup and Zwick [13] using some new insights and our 
single source data structure mentioned above. These modifications make the approximate 
shortest-paths oracle of Thorup and Zwick handle vertex failure easily, and (surprisingly) 
still preserving the old (optimal) trade-off between the space and the stretch. For precise 
details, see Theorem 15.31 

For the algorithmic details missing in this extended abstract due to page limitations, 
we suggest the reader to refer to the journal version [10]. Our data structures can be easily 
adapted for handling edge failure as well without any increase in space or time complexity. 

2. Preliminaries 

We use the following notations and definitions in the context of a given undirected 
graph G = (V, E) with n = \V\, m = \E\ and a weight function oj : E — > R + . 

• T r : single source shortest path tree rooted at r. 

• P(x,y) : the shortest path between x and y. 
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• 8(x,y) : the length of the shortest path between x and y. 

• P(x,y,z) : the shortest path between x and y avoiding vertex z. 

• 5(x,y,z) : the length of the shortest path between x and y avoiding vertex z. 

• T r (x) : the subtree of T r rooted at x. 

• G r (x) : the subgraph induced by the vertices of set T r (x) and augmented by vertex 
r and edges from r as follows. For each v £ T r (x) with neighbors outside T r (x), 
keep an edge (r,v) of weight = mm^ v)eEtU ^ Tr{x) (5(r,u) +uj(u,v)). 

• P :: Q : a path formed by concatenating path Q at the end of path P with an edge 
(u, v) £ E, where u is the last vertex of P and v is the first vertex of Q. 

• E(X) : the set of edges from E with at least one endpoint in X. 

Our algorithms will also use a data structure for answering lowest common ancestor (LCA) 
queries on T r . There exists an 0(n) time computable data structure which occupies 0(n) 
space and can answer any LCA query in O(l) time (see [3] and references therein). 

3. Single source 3-approximate shortest paths avoiding a failed vertex 

We shall first solve a simpler sub-problem where the vertex which may fail belong to 
a given path P £ T r . Then we use divide and conquer strategy wherein we decompose T r 
into a set of disjoint paths and for each such path, we solve this sub-problem. 

3.1. Solving the Sub-Problem : the failures of a vertex from a given path P(r,t) 

Given the shortest path tree T r , let P(r,i) = (r(= xq), xi, Xk(= t)) be any shortest path 
present in T r . We shall design an 0(n) space data structure which will support retrieval of a 
3-approximate shortest path from r to any v € V when some vertex from P(r, t) fails. The 
preprocessing time of our algorithm will be 0(m + n log n) which matches that of Dijkstra's 
algorithm. The algorithm is inspired by the algorithm of Nardelli et al. for computing 
the most vital vertex on a shortest path. Consider vertex Xi lying on the path P(r, t). We 



r 




Figure 1: Partitioning of the shortest path tree T r at x% € P 

partition the tree T r \{xi} into the following 3 parts (see Figure [T]). 

(1) Ui : the tree T r after removing the subtree T T (xi) 

(2) Di : the subtree of T r rooted at xi + \ 

(3) Oi : the portion of T r left after removing Ui, Xi, and Di. 

Note that a vertex of the tree T r is either a vertex of the path P (r, t) or it belongs to some 
Oi for some %. We build the following two data-structures of total O(n) size. 
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(1) a data structure to retrieve 3-approximate shortest path from r to any v G Di. 

(2) a data structure to retrieve 3-approximate shortest path from r to any v G Oj. 

3.1.1. Data structure for 3-approximate shortest paths to vertices of Di when X{ has failed. 
Consider the vertex Xj+i and any other vertex y G Di. Note that the shortest path P(xj+i, y) 
remains intact even after removal of Xi, and its length is certainly less than 5(r,y). Based 
on this simple observation one can intuitively see that in order to travel from r to y when 
Xi fails, we may travel along shortest route to Xj+i (that is P(r, Xj+i, Xi)) and then along 
P(xj_i_i,y). Using triangle inequality and the fact that the graph is undirected, the length 
of this path P(r, Xj+i, x{) :: P(xi+i,y) can be approximated as follows. 

S(r,x i+1 ,Xi) + 6(x i+1 ,y) < 6(r,y,Xi) + S(y,x i+1 ,Xi) + 6(x i+1 ,y) 

< S(r,y,Xi) + 26(x i+1 ,y) 

< 5(r,y,Xi) + 25(r,y) <35(r,y,Xi) 

Therefore, in order to support retrieval of 3-approximate shortest path to any v G Di in 
optimal time, it suffices to store the path P(r, Xj+i, Xi). 

In order to devise ways of 6ffici6nt computation and compact storage of P(r ; , x^) 
for a given %, we use the following lemma about the structure of the path ~P(r, x^i, x^). 

Lemma 3.1. The shortest path P(r, Xj+i,Xj) is of the form P\ :: P2 where Pi is a shortest 
path from r in the subgraph induced by [/j U Oi, and Pi is a path present in Di. 

It follows that in order to compute P(r, Xj+i, Xj), first we need to compute shortest paths 
from r in the subgraph induced by Ui U Oj. Let Si(r, v) denote the distance from rtoi)£ 
Ui U Oi in this subgraph. Note that 5i(r, v) for v G Ui and the corresponding shortest path is 
the same as in the original graph, and is already present in T r . For computing shortest paths 
from r to vertices of Oi, we build a shortest path tree (denoted as T r (Oi)) from r in the 
subgraph induced by vertices Oi U {r} and the following additional edges. For each z G Oi 
with at least one neighbor in Ui, we add an edge (r, z) with weight = rain^^^g ^jj.{5{r, u)+ 
u(u,z)). Applying Lemma 3.1, let (yi,Zi) be the edge of P(r, Xj+i, Xj) joining the sub 
path present in Ui U Oi with the sub path present in Di. This edge can be identified 
using the fact that this is the edge which minimizes 5i{r,y) + uj{y,z) + 5(xi+i,z) over all 
z G Di,y G Ui U Oi,{y,z) G E. The vertex Xj+i stores the path P(r, Xj+i, Xj) implicitly 
by keeping the edge {y-i,Zi) and the tree T r (Oi). The shortest path P(r, Xi + \,Xi) can be 
retrieved in optimal time using the trees T r , T r (Oi), and the edge (yj+i,Zj+i). Due to 
mutual disjointness of Oj's, the overall space requirement of the data structure for retrieving 
P(r, Xi + \,Xi) for all i < k will be 0{n). 

3.1.2. Data structure for 3-approximate shortest paths to vertices of Oi when Xi has failed. 
In order to compute 3-approximate shortest path to Oi upon failure of Xj, we shall use the 
approximate shortest paths to Di as computed above. Here we use an interesting observation 
which states that if we have a data structure to retrieve a-approximate shortest paths from 
r to vertices of Di when x« fails, then we can use it to have a data-structure to retrieve 
a-approximate shortest paths to vertices of Oi as well. To prove this result, this is how 
we proceed. Consider the subgraph induced by Oi and augmented with vertex r and some 
extra edges which are defined as follows. 
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• For each oGOj having neighbors from Ui, keep an edge (r,o) and assign it weight 
= ^(u,o)eE,ueUi(S{r,u) +u(u,o)). 

• For each o € Oi having neighbors from Di, keep an edge (r, o) and assign it weight = 
mm( u o j eE ueDj (5(r,u,Xi) + uj(u,o)), where 5(r, u,Xi) is the a-approximate distance 
to u upon failure of Sj. (In the present situation we have a = 3.) 

Let us denote this graph as G r (Oi). Observation 13.31 is based on the following lemma which 
is easy to prove. 

Lemma 3.2. The Dijkstra's algorithm from r in the graph G r (Oj) computes a-approximate 
shortest paths from r to all v 6 Oi avoiding X{. 

Observation 3.3. If we can design a data structure for retrieving (1 + e)-approximate 
shortest paths from r to vertices of Di upon failure of Xi, then it can also be used to design 
a data structure which can support retrieval of (1 + e)-approximate shortest paths to all 
vertices of the graph upon failure of Xi. 

We compute and store the shortest path tree rooted at r in the graph G r {Oi). This tree 
along with the tree T r and the data structure described in the previous sub-section suffice 
for retrieval of 3- approximate shortest paths to o £ Oi upon failure of x%. 
Query answering: Suppose the oracle receives a query asking for approximate shortest 
path from r to v avoiding Xi € P(r, i). It first invokes lowest common ancestor (LCA) 
query between v and Xi on T r . If LCA(v,Xi) ^ Xi, the shortest path from r to v remains 
unaffected and so it reports the path P(r, t). Otherwise, it determines if v € Di or v € Oi. 
Depending upon the two cases, it reports the approximate shortest path between r and Vi 
using one of the two data structures described above. 

Theorem 3.4. An undirected weighted graph G = (V,E), a source r € V, and a shortest 
path P € T r can be processed in 0(m + n log n) time to build a data structure of 0{n) space 
which can report 3- approximate shortest path from r to any v € V avoiding any single failed 
vertex from P. 

3.2. Handling the failure of any vertex in T r 

We follow divide and conquer strategy based on the following simple lemma. 

Lemma 3.5. There exists an 0(n) time algorithm to compute a path P in T r whose removal 
splits T r into a collection of disjoint subtrees T r (vi), ...T r (vj) such that 

• |T r (uj)| < re/2 for each i < j. 

• P Ui T r { Vi ) =T and PC) T r ( Vi ) = Vi. 

First we compute the path P € T r as mentioned in Lemma 13.51 We build the data 
structure for handling failure of any vertex from P by executing the algorithm of Theorem 
13.41 Let vi, Vj be the roots of the sub trees of T r connected to the path P with an edge. 
For each 1 < i < j, we solve the problem recursively on the subgraph G r (vi), and build 
the corresponding data structures. Lemma 13.51 and Theorem 13.41 can be used in straight 
forward manner to prove the following theorem. 

Theorem 3.6. An undirected weighted graph G = (V,E) can be processed in 0(m log n + 
re log 2 re) time to build a data structure of size 0(n log re) which can answer, in optimal time, 
any 3-approximate shortest path query from a given source r to any vertex v £ V avoiding 
any single failed vertex. 
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4. Single source (l+e)-approximate shortest paths avoiding a failed vertex 

In this section, we shall present a compact data structure for single source (1 + e)- 
approximate shortest paths avoiding a failed vertex in an unweighted graph. Let level(v) 
denote the level (or distance from r) of vertex v in the tree T r . Let U x , D x ,O x denote 
the partitions of the tree T r formed by deletion of vertex x, with the same meaning as 
that of Ui,Di,Oi defined for Xi in the previous section. On the basis of Observation 13. 3| 
our objective is to build a compact data structure which will support retrieval of (1 + e)- 
approximate shortest-paths to vertices of D x upon failure of x for any x G V. Let uchild(x) 
denote the root of the subtree corresponding to D x (it is similar to Xj+i in case of Di). For 
reporting approximate distance between r and v G D x when x fails, the data structure of 
previous section reports path of length 5(r, uchild(2;), x)+<5(uchild(x), v) which is bounded 
by 5(r, v,x) + 25(uchild(x),v). It should be noted that the approximation factor associated 
with it is already bounded by (1 + e) for any e > if the following condition holds. 
C : uchild(x) is close to v, that is, <5(uchild(x), v) < §<5(r, v). 

We shall build a supplementary data structure which will ensure that whenever the 
condition C does not hold, there will be some ancestor w of v lying on ~P(x,v), called a 
special vertex, satisfying the following two properties. 

(1) 5(w,v) <C 5(r,v), that is w is much closer to v than r. 

(2) vertex w stores approximate shortest path to r avoiding x (with the approximation 
factor arbitrarily close to 1). 

We shall refer to such vertices w as special- vertices. 

4.1. Constructing the set of special vertices 

Let h be the height of BFS tree rooted at r. Let L be a set of integers such that 
L = {*IL(1 + e T\ < M- For a given i G L, we define a subset S{ of special vertices as 
Si = {u € V\level(u) = [(1 + e)*J A |T r (n)| > elevel(u)}. We define the set of special 
vertices as S = UvieL'S'i- In addition, we also introduce the following terminologies. 

• S(v): the nearest ancestor of v which belongs to set S. 

• V(u): For a vertex u € S, V(u) denotes the set of vertices v E V with S(v) = u. 
In essence, the vertex u will serve as the special vertex for each vertex from V(u). 
For failure of any vertex x £ P(r, u), each vertex of set V(u) will query the data 
structure stored at u for retrieval of approximate shortest path/distance from the 
source. 

We now state two simple lemmas based on the above construction. 

Lemma 4.1. Let v G V\S, then 5(v,S(v)) < (-^)level(v) if e < 1 

Lemma 4.2. Let u be a vertex at level I and u € S. Then V(u) > e£. 

If we can ensure that the data structure for a special vertex u (for retrieving approximate 
shortest paths from r upon failure of any x G P(r, u)) is of size 0(level(u)), then it would 
follow from Lemma 14.21 that the space required by our supplementary data structure will 
be linear in n. 
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4.2. The data structure for a special vertex 

Consider a special vertex v with level(v) = |_(1 + e)*J We shall now describe a compact 
data structure stored at v which will facilitate retrieval of approximate shortest path from 
r to v upon failure of any vertex x £ P(r, w). 

Let v' be the special vertex which is present at level [(1 + e )* _1 J an d is ancestor of v. 
The data structure stored at v will be defined in a way that will prevent it from storing 
information that is already present in the data structure of some special vertex lying on 
P(r, v'). 

If x G P(v',v), then the data structure described in the previous section itself stores a 
path which is (1 + 2e)-approximation of P(r, v , x). 

Let us now consider the nontrivial case when x G P(V, v'), x ^ v' . In order to discuss 
this case, we would like to introduce the notion of detour. To understand it, let us visualize 
the paths P(r, v,x) and ~P(r,v) simultaneously. Since P(r, v,x) and P(r, v) have the same 
end-points and x doesn't lie on P(r, v, x), there must be a middle portion of P(r, v, x) which 
intersects P(r, v) at exactly two vertices, and the remaining portion of P(r, v,x) overlaps 
with P(r, v). This middle portion is called a detour. We now define it more formally. Let 
a and b be two vertices on the shortest path P(r, v). We use a -< b to denote that vertex a 
is closer to r than vertex b. The notation a -< b would mean that either a -< b or a = 6. . 
So here is the definition of detour (and the underlying observation). 

Definition 4.3. Let x € P(r, y). When x fails, the path P(r, y, x) will be of the form 
of P(r, a) :: :: P(6, y), where r<a^x<b<y and the path p a ^ is such that 
p a fi nP(a, 6) = {a,b}. In other words, p aj h meets P(a, 6) only at the end points. We shall 
call p a ,b as the detour associated with the shortest path P(r, y,x). 

Let p at b represent the detour w.r.t. to P(r, v,x). The handling of failure of vertices 
x E P(r,v) which lie above v' would depend upon the detour p a j,. This detour can be of 
any of the following types (see Figure [2] for illustration). 

• I : b < v'. 

• ll:v'<b. 




L(i + e) l J 




(i) 

Figure 2: pa^ is shortest detour of P(r, v,x). (i) : detour of type I, (ii) : detour of type II 



Handling detours of type I is relatively easy. Let w be the farthest ancestor of v such 
that w G S and level of w is greater or equal to the level of b. In this case, v stores the 
corresponding detour implicitly by just keeping a pointer to the vertex w. 

Handling detours of type II is slightly tricky since we can't afford to store each of them 
explicitly. However, we shall employ the following observation associated with the detours 
of type II to guarantee low space requirement. 
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Observation 4.4. Let a%, ct2,- ■ ■ ,at be the vertices on P(r, v) (in the increasing order of 
their levels) such that the shortest detour corresponding to P(r, v, ai) is of type II Vi, then 

8(r,v,ati) > 5(r,v,a 2 ) >■■■> 5(r,v,a t ) 

It follows from the above observation that if 8(r, v,oti) < (1 + e)8(r, v, ay) for any i < j, 
then P(r, v, an) may as well serve as (1 + e)-approximate shortest path from r to v avoiding 
aj. In other words, we need not store the detour associated with P(r, v, ay) in such situation. 
Using this observation, we shall have to explicitly store only 0(log 1+e n) detours of type II. 
Moreover, we do not store explicitly detours of type II whose length is much larger than 
level{v). Specifically, if P(r, v,x) > -level(v), then v will merely store pointer to the path 
P(r, uchild(x), x) :: P(uchild(x), v ). This ensures that each detour of type II which v has 
to explicitly store will have length 0(-level(v)). 

It follows from the above description that for a special vertex v and x G P(r, v), the data 
structure associated with v stores (1 + 2e)-approximation of the path P(r, v,x). Moreover, 
the total space required by the data structure associated with all the special vertices will be 
0(n^§^). This supplementary data structure combined with the data structure of previous 
section can report (1 + 6e)-approximation of P(r, z, x) for any z, x G V . 

Theorem 4.5. Given an undirected unweighted graph G = (V,E), source r G V, and any 
e > 0, we can build a data structure of size 0(n^^) that can report (1 + e)- approximate 
shortest path from r to any z G V avoiding any failed vertex in optimal time. 

5. All-pairs (2k — 1)(1 + e)-approx. distance oracle avoiding a failed vertex 

We start with a brief description of the approximate distance oracle of Thorup and 
Zwick |13j . The key idea to achieve sub-quadratic space is to store distance from each 
vertex to only a small set of vertices. For retrieving approximate distance between any two 
vertices u, v G V, it is ensured that there is a third vertex w which is close to both of them, 
and whose distance from both of them is known. To realize this idea, Thorup and Zwick 
|13| introduced two novel structures called ball and cluster which are defined for any two 
subsets A, B of vertices as follows, (here 8(v, B) denotes the distance between v and its 
nearest vertex from B). 

Ball(v, A, B) = {w£ A\8{v, w) < 8(v, B)} C{w, A, B) = {v G V\8(v, w) < 8(v, B)} 

Construction of (2k — l)-approximate distance oracle of Thorup and Zwick [13] employs a 
k-level hierarchy = (Aq 5 A\ 3 A%... 5 A^—\ D Ak) of subsets of vertices as follows. 
Aq = V, Ak = 0, and Ai + i for any i < k — 1 is formed by selecting each vertex from Ai 
independently with probability n" 1 ^ . 

The data structure associated with the (2k — l)-approximate distance oracle of Thorup 
and Zwick [13] stores for each vertex v €V the following information : 

• the vertices of set Di^Bal^v, A{, Ai + i) (and their distances). 

• the vertex from Ai nearest to v (to be denoted as Pi(v)). 

Due to randomization underlying the construction of the expected size of 

Ball(v, Ai, Ai + \) is Ofa 1 ^), and hence the space required by the oracle is 0(kn 1+1 ^ k ). 
We shall now outline the ideas in extending the (2k — l)-approximate distance oracle to 
handle single vertex failure. Kindly refer to the extended version [10] of this paper for 
complete details. 
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5.1. Overview of all-pairs approx. distance oracles avoiding a failed vertex 

Firstly the notations used by the static approximate distance oracle of |13| , in particular 
ball and cluster, get extended for single vertex failure in a natural manner as follows, (here 
5(v,B,x) is the distance between v and its nearest vertex from B in G\{x}). 

Ball x (v,A,B) = {w£ A\5(v,w,x) < 5(v,B,x)} 

C x (w,A,B) = {ve V\5{v,w,x) < 6(v,B,x)} 

Let p x {v) denote the vertex from Ai which is nearest to v in G\{x}. Along the lines of the 
static approximate distance oracle of Thorup and Zwick [13], the basic operation which the 
approximate distance oracle avoiding a failed vertex should support is the following : 

Report distance (exact or approximate) between v and w G Ai if w G Ball x {v , Ai, -Ai+i) 
for any given v,x G V . 

However, it can be observed that we would have to support this operation implicitly in- 
stead of explicitly keeping Ball x {v, Ai, Ai + \) for each v, x, i. Our starting point is the simple 
observation that clusters and balls are inverses of each others, that is, w G Ball x {v, Ai, Ai + \) 
is equivalent to v G C x {w, Ai, Ai + \). Now we make an important observation. Consider 
the subgraph Gi{w) induced by the vertices of set \J x& yC x [w, Ai, A4+1). This subgraph 
preserves the path P(w,v,x) for each x,v G V if w G Ball x (v, Ai, Ai + \). So it suffices 
to keep a single source (approximate) shortest paths oracle on Gi(w) with w as the root. 
Keeping this data structure for each w G Ai provides an implicit compact data structure for 
supporting the basic operation mentioned above. Using Theorem 14.51 it can be seen that 
the space required at a level i will be of the order of YlweA- I ^xeV C x (w, Ai, Ai + i)\, but 
it is not clear whether we can get an upper bound of the order of n 1+1 / fc on this quantity. 
Here, as a new tool, we introduce the notion of e-truncated balls and clusters. 

Definition 5.1. Given a vertex x, any subsets A,B, and e > 
Ball x (v, A, B, e) = iw G A\5{v,w,x) < 

Instead of dealing with the usual balls (and clusters) under deletion of single vertex, 
we deal with e-truncated balls (and clusters) under deletion of single vertex. We note that 
the inverse relationship between clusters and balls gets seamlessly extended to e-truncated 
balls and clusters under single vertex failure as well. That is, 

\U x£V C x {w,Ai,A i+1 ,e)\ = J2\U xGV Ball x (v,Ai,Ai +1 ,e)\ 

weAi vev 

So it suffices to get an upper bound on the size of the set U x ^.yBall x {v, Ai,Ai + \, e) for any 
vertex v G V. The following lemma states a very crucial property of e-truncated balls which 
leads to prove the existence of a small set S of 0{-^ logn) vertices such that 

U xGV Ball x {v, Ai,A i+l ,e) C U xeS Ball x {v, Ai,A i+1 ) U Ball(v, A h A i+X ) (5.1) 

Lemma 5.2. In a given graph G = (V,E), let v be any vertex and let u = pi+\(v). Let x\ 
and X2 be any two vertices on the P[v, u) path with x\ appearing closer to v on this path 
and 8(v, Ai + i, xi) < (1 + e)S(v, Ai+i, £2). Then 

Ball Xl (v,Ai,A i+1 ,e) C Ball(v, Ai, A i+1 ) U Ball X2 {v, Ai, A i+l ) 
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Proof. Let w be any vertex in A%. It suffices to show the following. If uu does not belong 
to Ball(v, Ai, U Ball X2 (v, Ai, Ai + i), then w does not belong to Ball Xl (v, A4, -Ai+i, e). 

The proof is based on the analysis of the following two cases. 
Case 1 : The vertex X2 is present in P(v,w,xi). 

Since, w £ Ball(v, Ai, Ai + \), therefore, S(v,w) is at least 5(v,u). Hence using triangle 
inequality, 5{v,X2) + 5{x2,w) > 5(v,u). Now S(v,u) = 5{v,X2) + 5{x2,u) (since X2 lies on 
P(v,u)). Hence 5(x2,w) > 5{x2,u). Moreover, since x\ does not appear on P(x2,n), so 
5(x2,u) = 5(X2,U, xi). So 

5(x 2 , w,xi) > 5(x 2 ,u, x\) (5.2) 
Now it is given that X2 £ P(v,w,x\), so P(v, w, X\) must be of the form P(v, X2, xi) :: 
P(x2,tf,xi), the length of which is at least 5(v,X2,xi) + 5(x2,u,x\) using Equation 15.21 
The latter quantity is at least S(v, u, x±) which by definition is at least 5(v, Aj+i, x%). Hence 
w £ Ball Xl (v, Ai, and therefore, w £ Ball Xl (v, Ai, Ai + \,e). 

Case 2 : The vertex X2 is not present in P{v,w,x\). 

In this case, 5(v,w,x\) = 5(v,w, {x\, X2}) > 5{v,w,X2). The value 5(v,w,X2) is in turn 
at least 5(v, Ai + i, X2) since w £ Ball X2 (v, Ai, Ai + i). It is given that 5(v, Ai + i, X2) > 

5(v,Ai+i,xi) ^ hen{;e condude that S(v,W,Xi) > ^iilM , Sow^ Boil* 1 («, ^,^1,6). 

■ 

We shall now outline the construction of a small set S of vertices which will satisfy 
Equation 15.11 Let u = pi + \{v) and let P(v,u) = v{= xq), x\, X((= u). Observe that 
U xg y Ball x (v, Ai, Ai+i, e) = Ui<j<iBall Xj (v, Ai, A{ + \, e). For any node x € P(u,v), let 
value(x) = 5{v, Ai + \,x), and let h be the maximum value of any node on this path. The 
set S is initially empty. 

Let a(l) be the largest index from [1,1] such that value{xi) > h/(l + e). It can 
be seen that for all j < a(l), 5{v, Ai + \,Xj) < (1 + e)5(v,Ai + i,x a ru). Therefore, it 
follows from Lemma [5.21 that for each vertex x G {x\, x a m}, Ball x (v, Ai, Ai + i,e) C 
Ball Xa W (v, Ai, A i+1 ) U Ball(v,Ai, A i+1 ). So we insert x Q ( 1 ) to S. Similarly a(2) € 
[a(l) + 1,^] be the greatest integer such that value (2W2)) > ^/(l + e ) 2 - We add x a ( 2 ) 
to S 1 , and so on. It can be seen that the set S constructed in this manner will satisfy 
Equation 15.11 and its size will be 0(log 1+e h) = O(^jp). 

It can be shown using elementary probability theory that for each x 6 V, the set 
Ball x (v, Ai, Ai + i) has size 0(n 1 / fc logn) with high probability. Therefore, the construction 
of the set S outlined above implies the following crucial bound for each v € V, i < k — 1 
which helps us design all-pairs approximate distance oracle avoiding a failed vertex. 

I U xev Ball x {v, Ai,A i+1 ,e)\ = O ^^^j 

Using this equation, and owing to inverse relationship between clusters and balls, it fol- 
lows that E w , Al I U, e v C x ( W ,A t ,A l+1 ,e)\ = O (^^j, Our all-pairs approximate 
distance oracle avoiding any failed vertex will keep the following data structures. 

• Letpf(v,e) denote a vertex w from A4 with 5(v, w, x) < (l+e)5(v,pf(v),x). We keep 
a data structure Nj Vi < k, using which we can retrieve pf (v, e). This data-structure 
is obtained by suitable augmentation of our single source (1 + e)-approximate oracle. 

• For each w G Ai, we keep our single source (1 + e)-approximate oracle in Gi(w,e) 
which is the subgraph induced by U x <=vC x (w, Ai, e). 
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It follows that the overall space required by the data structure will be 0(kn 1+1 / k lo ^ 4 n ). 
The query algorithm and the analysis on the stretch of the approximate distance reported 
by the oracle are similar in spirit to that of Thorup and Zwick [13] (see [10] for details). 

Theorem 5.3. Given an integer k > 1 and a fraction e > 0, an unweighted graph G = 
(V,E) can be processed to construct a data structure which can answer {2k — 1)(1 + e)- 
approximate distance query between any two nodes u € V and v € V avoiding any single 
failed vertex in 0{k) time. The total size of the data structure is 0{kn 1+1 l k n ). 

Future work, (i) Can we design a data structure for single source (l + e)-approx. shortest 
paths avoiding a failed vertex for weighted graphs ? Such a data structure will immediately 
extend our all-pairs approx. distance oracle avoiding a failed vertex to weighted graphs, 
(ii) How to design approx. distance oracles avoiding two or more failed vertices ? Recent 
work of Duan and Pettie [8], and Chechik et al. [5) provides additional motivation for this. 
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